Statistical investigations

The Ministry is migrating nzmaths content to Tāhurangi.           
Relevant and up-to-date teaching resources are being moved to Tāhūrangi (tahurangi.education.govt.nz). 
When all identified resources have been successfully moved, this website will close. We expect this to be in June 2024. 
e-ako maths, e-ako Pāngarau, and e-ako PLD 360 will continue to be available. 

For more information visit https://tahurangi.education.govt.nz/updates-to-nzmaths

Statistical Investigations: Level 1

The key idea of statistical investigations at level 1 is collecting data as evidence to tell a story about a question of interest.

At level 1 students should be working with survey data that they have collected about themselves and their classmates.  As a class with their teacher they should be starting to use the PPDAC (Problem, Plan, Data, Analysis, Conclusion) cycle in their investigations.  This involves posing an investigative question that they want to address, collecting and sorting data to answer the question, displaying the data, making statements about what the data shows and answering the question.

Students are typically interested in questions like “who in our class has the most children in their family?”, or “what is the favourite fruit in our class?”.  These pre-summary questions are suitable for students and this level and focus more on an individual than the aggregate of the data.  

Data displays are limited only by the students’ imaginations.  For example, students might show the different types of shoes in the class by taking one shoe from each student and building a display.  Students may well draw individual case plots, for example, they draw a graph of the number of children in family, where the students’ names are on the horizontal axis and the number of children in the family is on the vertical axis. 

Students will be making statements about individuals in their displays. For example, Hemi, Jane and Tiana have four children in their family. These statements need to reflect what the data is showing.

At this level learning experiences should include students using the statistical enquiry cycle to conduct investigations by:

  • exploring areas of student interest with the teacher
  • developing investigative questions about categorical data with the teacher,
  • designing how they will gather, sort and count the data with the teacher,
  • collects data with the teacher
  • creating displays to illustrate the data they have collected
    • displays could be individual case plots 
    • displays could be groupings of similar objects
  • making statements about their displays
    • statements are likely to be about individuals
    • statements need to reflect what the data is showing
  • answering the investigative question with the teacher.

Statistical Investigations: Level 2

The key idea of statistical investigations at level 2 is letting go of the individual’s story and moving towards telling the class story.

At level 2 students are building on the ideas from level one and refining their understanding of different aspects of the PPDAC (Problem, Plan, Data, Analysis, Conclusion) cycle.  A key transition point at this level is moving students’ data display knowledge from individual case plots to frequency plots of the variable of interest.

Investigative questions will be similar to those posed previously and will include categorical and whole number variables.

Data displays become a summary of the individual case plots that they were drawing at the previous level.  For example, the number of children in a family is now on the horizontal axis. Students place themselves on the graph according to the number of children in their family (this could be through using sticky notes).  As each student adds their sticky note to the display the frequency builds up. The frequency is recorded on the vertical axis.

Students will be making summary statements, for example, three students in our class have four children in their family (read the data), or five students have 1 or 2 children in their family (read between the data). 

Teachers should be encouraging students to read beyond the data by asking questions such as: “If a new student joined our class, how many children do you think would be in their family?”

At this level learning experiences should include students using the statistical enquiry cycle to conduct investigations by:

  • exploring areas of student interest with the teacher
  • posing investigative questions about categorical and whole number data with the teacher
  • designing how they will collect the data with the teacher
  • collecting data from the class or using secondary data sources to answer the investigative question
    • recording data using a variety of methods, e.g. using data cards, tables, tally charts
    • unpacking multivariate data cards of secondary data before exploring the variables (data cards should have 3-5 variables, e.g. four on the card and then the colour of the card for a fifth variable such as gender or year level)
  • making data displays where the variable of interest in on the horizontal axis and the data points (e.g. dots, cards, sticky notes) represent the frequency vertically (vertical axis if used)
  • making summary statements about the data, connecting it to the group that was investigated
    • reading the data, e.g. three students in our class have four children in their family
    • reading between the data e.g. five students in our class have one or two children in their family
  • being encouraged to read beyond the data by their teacher asking questions such as: “If a new student joined our class, how many children do you think would be in their family?”; 
  • answering the investigative question 

Statistical Investigations: Level 3

The key idea of statistical investigations at level 3 is telling the class story with supporting evidence.

Students are building on the ideas from level two and their understanding of different aspects of the PPDAC (Problem, Plan, Data, Analysis, Conclusion) cycle.  Key transitions at this level include posing summary investigative questions, and collecting and displaying multivariate and time series data.

Summary or time series investigative questions will be posed and explored.  Summary investigative questions need to be about the group of interest and have an aggregate focus.  For example, what are typical numbers of children in a family for students in our class? What types of fruit do students in our class like?

Data displays build on the frequency plots from level two and can be formalised into dot plots and bar graphs.  Students should be encouraged to show a second variable, for example, by using colour. They may like to look at boys and girls fruit preferences.

Students will be making summary statements, for example, the most common number of  children in a family for our class is three, nine students have three children in their family (read the data), or most students (16 students out of the 27 in our class) have between two and four children in their family (read between the data). 

Teachers should be encouraging students to read beyond the data by asking questions such as: “If a new student joined our class, how many children do you think would be in their family?”  

At this level learning experiences should include students using the statistical enquiry cycle to conduct investigations by:

  • identifying a broad area of student interest to explore using the statistical enquiry cycle
  • posing investigative questions about summary (category and whole-number) and time series situations 
    • making predictions/assertions about what they expect to find out 
  • designing how they will collect the data (with teacher guidance)
  • collecting data from the class or a wider group (primary data)
    • recording data systematically
  • using data collected by others (secondary data) 
    • working with secondary data sources that use multivariate data cards or tables to allow a variety of variables to be explored
    • accessing secondary data through software such as CODAP 
      • E.g. using existing data sets available through software tools 
      • using secondary data sets shared by the teacher who has used the software tool as a platform
  • using a variety of data displays e.g. dot plots, bar graphs
    • exploring a second variable, for example, by using colour
    • using statistical software  
  • making summary statements about the data 
    • connecting statements to the group that was investigated
    • identifying patterns in context
      • reading the data, e.g. the most common number of pets for the children in our class is one, two students in our class have 10 or more pets
      • reading between the data e.g. most students (17 out of 28) in our class have one or two pets
    • identifying trends in context
      • reading the data e.g. the height of the seedling after 10 days is 5cm; the number of icecreams sold during summer is 1053
      • reading between the data e.g. there is a repeating pattern with more ice creams sold in summer and less in winter; the seedling appears to be growing quickly initially and after 10 days it is growing more slowly.
  • being encouraged to read beyond the data by answering questions such as: “If a new student joined our class, how many children do you think would be in their family?”; “We missed measuring the height of the seedling over the weekend, what do you think the measurements would have been for Saturday and Sunday?”
  • answering the investigative question 

Statistical Investigations: Level 4

The key idea of statistical investigations at level 4 is telling the class story in detail with supporting evidence.

Students are building on the ideas from level three about different aspects of the PPDAC (Problem, Plan, Data, Analysis, Conclusion) cycle.  Key transitions at this level include posing comparison and relationship investigative questions, planning investigations including data generation, collecting and displaying measurement data, and comparing distributions visually.

Comparison and relationship investigative questions will be posed and explored.  Comparison investigative questions need to be about the group of interest and have an aggregate focus.  For example, do the boys in our class tend to be taller than the girls in our class? Is there a relationship between arm span length and height for the students in our class?

Students should be planning to collect their own data for the investigative question they have posed.  This includes determining appropriate variables and data collection methods. For example, they need to realise that to answer the first question they will need to measure student’s heights.  Along with this they will need to think about what units to measure with and whether students should leave their shoes on or not, and who will take the measures. This is data generation.  

Students should be using dot plots and scatter plots to display data.  When comparing dot plot distributions visually they can identify the middle group by circling it and reason about the placement of the middle groups (shift) relative to one another.   They can compare (approximate) centres and the variation of the data in the middle groups. Students can use tools such as hat plots and any statistical software that is available. For scatter plots students should be looking at features such as the trend of the data points and how close the points are to the trend.  Adding a third variable, for example gender, by using colour allows for further exploration. 

Students should be writing statistically sound statements about what their displays show.  The starter “I notice...” is a useful way to encourage students to write about what their displays show.  In addition, students should be encouraged to write “I wonder...” statements for further investigation.

At this level learning experiences should include students using the statistical enquiry cycle to conduct investigations by:

  • identifying broad areas to explore using the statistical enquiry cycle
  • posing investigative questions about summary, comparison, relationship and time series situations 
    • making predictions/assertions about what they expect to find out 
  • planning for data collection using surveys 
    • determining variables needed to answer investigative questions
    • planning how to collect data for each variable 
      • posing survey questions
      • deciding how to make accurate measures when needed
    • collecting data from the class or a wider group (where all the wider group can be surveyed)
  • using data collected by others - secondary data sources
    • interrogating the data e.g. what were the survey questions posed? Who was the data collected from? How was the data collected? What is the variable? How was it measured?
  • exploring summary, comparative, bivariate and time series data
    • using multiple representations to analyse and display data
    • using statistical software to analyse and display data 
    • exploring measurement and categorical data
    • describing distributions (summary and comparison - category and quantitative), relationships (bivariate), and trends (time series)
  • communicating findings using the entire statistical enquiry cycle
    • answering the investigative question using evidence from their analysis

Statistical Investigations: Level 5

The key idea of statistical investigations at level 5 is telling a story about the wider universe with supporting evidence.

Students are building on the ideas from level four about different aspects of the PPDAC (Problem, Plan, Data, Analysis, Conclusion) cycle.  The key transition at this level is the acknowledgement that samples can be used to answer questions about populations.

Students will be posing investigative questions about populations and using samples to answer these.

Students need to realise that the data collected may have to be cleaned.  To help with this they should be familiar with the survey questions posed, who the data was collected from and how the data was collected.  Once data is identified as needing cleaning, strategies on how to do this should be discussed, for example, whether the value removed, or “cleaned”.  

Students may need to recategorise data into broader categories or smaller categories depending on the question they are trying to answer.  They will be looking at patterns and trends in displays and using these to answer their investigative question. Thinking routines such as: “What information can you get from this plot?” and “What evidence do you have for saying that?” will be helpful for students.

Students will be starting to use informal methods to make comparisons between sample distributions using box plots and growing their reasoning about sampling variability, shape, spread, unusual and interesting features, and making a call. See CensusAtSchool 2009 Teachers Day

At this level learning experiences should include students using the statistical enquiry cycle to conduct investigations by:

  • posing investigative questions about populations for summary and comparison situations
  • posing investigative questions for relationship and time series situations
  • planning for data collection using surveys
    • determining the variables needed to answer investigative questions
    • considering sources of variation
    • deciding how to measure variables
    • considering ethics
    • posing data collection questions 
    • collecting data from a sample
      • samples likely to be convenience e.g. class; or ones that are generated using random sampling e.g. census at school samples
      • sampling design not expected at this level  
    • cleaning data
  • using data collected by others
    • interrogating the data e.g. What were the survey questions posed? Who was the data collected from? How was the data collected? What is the variable? How was it measured?
    • considering data ethics, privacy, ownership, data quality of the second hand data
    • cleaning data if needed
  • exploring summary, comparative, bivariate and time series data
    • using multiple representations to analyse and display data
    • using technology to analyse and display data
    • exploring measurement and categorical data
    • recategorising data as needed to answer the investigative question
    • describing distributions of samples (summary and comparison), relationships of group (bivariate) and  trends (time series) 
    • using measures of centre (e.g. median), spread (e.g. IQR), proportion (e.g. proportion of girls who walk to school)
    • finding the quadrant count ratio (bivariate)
    • using visual evidence to communicate features in context
    • “making the call” when using samples to answer investigative questions about populations
    • answering the investigative question using evidence
  • presenting a report of findings using the whole statistical enquiry cycle
    • integrating statistical and contextual information
    • providing an explanation or interpretation for findings
    • justifying findings

At this level learning experiences should include students using the statistical enquiry cycle to conduct experiments by:

  • posing investigative questions that can be answered using experiments
  • planning the experiment
    • determining control and response variables
    • designing experiments to collect data to investigate the situation
  • undertaking the experiment
  • using multiple representations to display results of experiments
  • presenting a report of their findings using the whole statistical enquiry cycle

Statistical Investigations: Level 6

The key idea of statistical investigations at level 6 is telling a story about a wider universe, taking variation and uncertainty into account, with supporting evidence.

Students are consolidating and refining their ideas about different aspects of the PPDAC (Problem, Plan, Data, Analysis, Conclusion) cycle.  Key transitions at this level include the integration of statistical and contextual knowledge to answer the investigative questions and making informal inferences about populations from samples. 

Students should be justifying variables and measures used in the data collection phase and thinking about the possible underlying population distributions for the variables of interest.

In the analysis phase students should be using multiple displays to show different features of the sample distributions.  Key features of the sample distributions should be discussed; integrating statistical and contextual information. Students will confidently be using informal methods to make comparisons about populations using sample distributions including reasoning about shift, overlap, sampling variability and sample size. 

Students should be reflecting on their findings and how this fits with real world experiences.

At this level learning experiences should include students using the statistical enquiry cycle to conduct investigations by:

See senior secondary guides S6-1 for more detail.

Statistical Investigations: Level 7

The key idea of statistical investigations at level 7 is creating and telling a story about a wider universe, considering sampling variability and sample size effects, with supporting evidence.

Students are using the PPDAC (Problem, Plan, Data, Analysis, Conclusion) cycle in different data collection contexts.  A key focus at this level is the use of surveys and associated random sampling techniques. Key transitions at this level include the integration of statistical and relevant contextual knowledge to answer investigative questions and making informal estimated comparison intervals for population parameters from samples. 

At this level students will be aware of how to design a suitable questionnaire specific to a given purpose and they will be using random sampling techniques in the data collection phase.  They should be evaluating choice of measures for variables and sampling and data collection methods. Students should be provided with relevant contextual knowledge about the situation under investigation. 

Students will confidently be using informal estimated comparison intervals to make comparisons about populations from sample distributions.

See senior secondary guides S7-1 and senior secondary guides S7-2 for more detail.

Statistical Investigations: Level 8

The key idea of statistical investigations at level 8 is creating, modelling and telling a story about a wider universe, supporting this with sophisticated statistical techniques and informed contextual knowledge.

Students are using the PPDAC (Problem, Plan, Data, Analysis, Conclusion) cycle in different data collection contexts.  A key focus at this level is the use of experimental design principles. Key transitions at this level include:

  • the integration of statistical and informed contextual knowledge to answer investigative questions,
  • using appropriate statistical models,
  • making statistical inferences about populations or processes from samples using methods such as bootstrapping or randomisation to determine estimates, confidence intervals, forecasts, and strength of evidence, and
  • evaluating all stages of the cycle. 

Students will be sourcing relevant contextual knowledge about the situation under investigation from places such as the internet, the school or local library, newspapers and magazines.

See senior secondary guides S8-1 and senior secondary guides S8-2 for more detail.